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Base Application 

BACKGROUND OF THE INVENTION 

5 

1. Field of the Invention 

The present invention relates to the field of data processing. More 
specifically, the present invention relates to the copying or archiving of web sites or 
web based applications that include files that are part of a file system as well as 
|10 non-file system structures. 

2. Background Information 

^ With advances in integrated circuit, microprocessor, networking and 

□ communication technologies, increasing number of devices, in particular, digital 
iU5 computing devices, are being networked together. As a result of this trend of 

□ increased connectivity, increasing number of applications that are network 
dependent are being deployed. Examples of these network dependent applications 
include but are not limited to, email, net-based telephony, world wide web (WWW) 
and various types of web based e-commerce, commonly referred to as web sites or 

20 web based applications (hereinafter, simply web based applications). Further, 
increasing number of software applications that were traditionally licensed or 
distributed through discrete distribution medium, such as diskettes, CDROMs and 
the like, are being distributed online or offered as web based applications, through 
private intranets or public networks like the Internet. 

25 Modern web based applications often include numerous files that are part of 

a file system as well as a substantial number of non-file system structures. 
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Examples of these non-file system structures include but are not limited data tables 
of the web based applications, schemas of the data tables, users of the web based 
applications, and so forth (see e.g. top portion of Fig. 1). Often times, whether it is 
for the purpose of backing up a web based application, for porting an application (or 
5 a portion thereof) or other reasons of the like, it is desirable to be able to efficiently 
copy/archive the application (or the portion of interest). 

Under the prior art, the various parts are typically copied or archived as 
separate pieces in a database and/or operating system dependent manner. For 
examples, a file system utility would be employed to copy/archive the files, whereas 
II 0 a database utility would be employed to copy/archive the data tables and their 
M schemas. Accordingly, under the prior art, the copying/archiving process is 
al cumbersome, inefficient, as well as being database and operating system 
SI dependent. 

Q Thus, a more efficient approach to copying/archiving a web base application, 

ill 5 in particular, an approach that is database and operating system independent, is 
?\ desired. 
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SUMMARY OF THE INVENTION 

A web based application having non-file system structures is copied or 
archived by first initializing a file to store the web based application, including 
5 creation of a root directory within the file. Then, data directories are created under 
the root directory, and a number of storage data objects under the data directories 
are initialized for all non-file system structures of the web based application. 
Thereafter, the non-file system structures are copied and stored into the storage 
data objects- In embodiments where the web based application further includes 
JjO files that are part of a file system, the files are stored as additional storage data 
^ objects under the root directory. 

\ JJ Accordingly, improved usability, efficiency, as well as database and operating 

"I system independence are achieved. 

;U5 
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BRIEF DESCRIPTION OF DRAWINGS 

The present invention will be described by way of exemplary embodiments, 
but not limitations, illustrated in the accompanying drawings in which like references 
5 denote similar elements, and in which: 

Figure 1 illustrates an overview of a typical modern web based application 
and the copying/archiving utility of the present invention, in accordance with one 
embodiment; 

Figure 2 illustrates the relevant operational flow of the copying/archiving 
£10 utility of Fig 1 , in support of the copying/archiving process of the present invention; 
M Figure 3 illustrates the organization of the resulting database and operating 

ill system independent copy of the web based application of Fig. 1, in accordance with 
\\ one embodiment; 

r| Figure 4 illustrates the relevant operational flow of the copying/archiving 

ill 5 utility of Fig 1, in support of the copying/restoring process of the present invention; 
^ Figure 5 illustrates an internal component view of a computer system 

suitable for use to practice the present invention, in accordance with one 

embodiment; 

Figure 6 illustrates an example structure description of a web based 
20 application being copied/archived; 

Figures 7a-7b illustrate an example data table schema of a data table of a 
web based application being copied/archived; and 

Figure 8 illustrates an example data table of a web based application being 
copied/archived. 
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DETAILED DESCRIPTION OF THE INVENTION 

In the following description, various aspects of the present invention will be 
described. However, it will be apparent to those skilled in the art that the present 
invention may be practiced with only some or all aspects of the present invention. 
For purposes of explanation, specific numbers, materials and configurations are set 
forth in order to provide a thorough understanding of the present invention. However, 
it will also be apparent to one skilled in the art that the present invention may be 
practiced without the specific details. In other instances, well known features are 
omitted or simplified in order not to obscure the present invention. 

Parts of the description will be presented in terms of operations performed by a 
processor based device, using terms such as data, table, schema, log-in names, 
initializing, creating, storing, retrieving, determining, and the like, consistent with the 
manner commonly employed by those skilled in the art to convey the substance of 
their work to others skilled in the art. As well understood by those skilled in the art, 
the quantities take the form of electrical, magnetic, or optical signals capable of being 
stored, transferred, combined, and otherwise manipulated through mechanical and 
electrical components of the processor based device; and the term processor include 
microprocessors, micro-controllers, digital signal processors, and the like, that are 
standalone, adjunct or embedded. Further, for the purpose of this application, the 
terms "web site" and "web based application" should be considered synonymous and 
interchangeable. 

Various operations will be described as multiple discrete steps in turn, in a 
manner that is most helpful in understanding the present invention, however, the 
order of description should not be construed as to imply that these operations are 
necessarily order dependent. In particular, these operations need not be performed 
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in the order of presentation. Further, the description repeatedly uses the phrase "in 
one embodiment", which ordinarily does not refer to the same embodiment, although 
it may. 

5 Overview 

Referring now to Figure 1, wherein a block diagram illustrating an overview of 
a web based application, the copy/archive utility of the present invention, and the 
resulting copy, in accordance with one embodiment, is shown. As illustrated and 
r% alluded to earlier, web based application 100 includes files 102 that are part of file 

system 122 of an operating system (not shown), and non-file system structures 104- 
:JJ 108. At least some of files 102 have associated access control lists (ACL). Non-file 
system structures 104-108 include in particular data tables 104 and their schemas 
106 of database system 124. Further, non-file system structures 104-108 include 
;3 other control structures 108. Of particular interest among data tables 106 is address 
^15 table 107, and among control structures 108 is user list 109. The number and 
3 content of files 102 as well as data tables 104 and their schemas 106 are application 
dependent. Their internal organizations are of course operating system and 
database dependent. That is, if web based application 100 is deployed on operating 
system OS-1 employing database DB-1, the internal organization of files 102 and 
20 data tables 104 (and their schemas 106) would be different, had web based 

application 100 been deployed on operating system OS-2 employing database DB-2 
instead. The relevant information contained in ACLs 103 includes the access control 
information for corresponding files 102. The relevant information contained in user 
list 109 includes authorized users of web based application 100. The relevant 
25 information contained in address book 107 typical includes the basic properties of the 
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users, such as their names, email addresses, and log-in user names for the domain 
web based application 100 is installed or deployed. 

As illustrated, and will be described in further detail below, copy/archive utility 
112 incorporated with the teachings of the present invention, may advantageously 
5 make a copy of web based application 100 in its entirety. Further, the internal 
organization of resulting copy 114 is independent of file system 122 and database 
124 (or their underlying operating system), on which web based application 100 is 
deployed. Accordingly, the present invention is particularly useful in facilitating 
archiving and/or porting of web based application 100. 
J{10 Note that while the preceding paragraph refers to the advantage of the present 

i; invention in copying a web based application 100 in "its entirety" (e.g. in an archiving 

situation or in a total porting situation), however it will be readily apparent from the 
"I description to follow that the present invention may also be advantageously employed 
□ to port a portion of an application, e.g. from one developer or developer vendor to 
Fiji 5 another, to be combined with another portion or portions of the application. 
q Before describing the present invention in further detail, it should be noted that 

while the preferred embodiment to be described advantageously copy web based 
application 100 in its entirety into a single file, in alternate embodiments, application 
100 may be copied into one or more files in like manner without detracting from 
20 achieving the desired database and operating system independence. 



Copying/Archiving 

Referring now to Figures 2-3, wherein two block diagrams illustrating the 
relevant operational flow of copy/archive utility 1 12 of Fig. 1 in support of the DB and 
25 OS independent manner of copying/archiving of the present invention, and the 
resulting DB and OS independent copy, respectively, in accordance with one 
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embodiment, are shown. Copy/archive utility 112 is assumed to have at least the 
capability of interacting with the operating system, more specifically, its file system 
122, and the database 124, under which web based application 100 is implemented. 
Such capabilities include e.g. the ability to request and receive data and associated 
control information from the database and the file system in the manner required by 
the database and the file system. In one embodiment, copy/archive utility 112 also 
supports an operation option for copying/archiving an application with or without the 
user information. 

As illustrated, the DB and OS independent approach to copying/archiving 

process of the present invention starts at block 202, with the creation and initialization 

of the resulting file, file 300. In preferred embodiments, resultant file 300 is a file that 

may be subjected to compression. More specifically, in various preferred 

embodiments, copy/archive utility 112 creates/initializes a "zippable" file as resultant 
file 300. 

Upon creating/initializing file 300, copy/archive utility 112 creates root directory 
302 in file 300, block 204. Thereafter, copy/archive utility 112 creates an application 
level data directory 304 under root directory 302, block 206. Next, copy/archive utility 
112 creates and initializes two storage objects 306-308 under data directory 304, 
blocks 208-210. At block 208, copy/archive utility 112 further stores a structural 
description of web based application 100 into first storage object 306. As described 
earlier, the structural description of web based application 100 enumerates the parts 
that make up application 100, such as its files, their pathnames, the data tables, their 
schemas, and so forth. In preferred embodiments, the structural description is 
expressed using a mark up language, more specifically, XML. An example of one 
such structural description 600 is shown in Fig. 6. At block 210, assuming the 
copying/archiving with user information option is selected, copy/archive utility 112 
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further stores a user description of the users of web based application 100 in the 
second storage object 308. In preferred embodiments, the users are similarly 
described employing XML (not shown). 

Thereafter, using the structural information of web based application 100 as a 
5 guide, copy/archive utility 112 creates a number of data table directories 310 under 
data directory 304, and creates and initializes a number of storage objects 312-314 
under the corresponding data table directories 310 to store data tables' data 104 and 
their schemas 106, blocks 212-216. In one embodiment, a pair of data table data 
^ and data table schema 104-106 is stored under each data table directory 310. Of 
^{10 course, in alternate embodiments, more than one pair of data table data and data 

table schema 104-106 may be stored under each data table directory 310 instead. At 
; jf blocks 214-216, copy/archive utility 112 further stores data tables' data 104 and their 
^ schemas 106 (including address table 107 if applicable), into storage objects 314 and 
;3 312 respectively. If copying/archiving of address table 107 is also included, the user 
lift 5 information are included or excluded in accordance with the operation option 
□ selected. Again, in preferred embodiments, data tables 104 and their schemas 106 
(including address table 107 if applicable), are described employing XML. 

One example each for how a data table schema 106 and a data table's data 
104 is described as they are stored into the storage objects are illustrated in Figs, 7a- 
20 7b and 8 respectively. As illustrated, the storage of a data table schema 106 includes 
identifications and descriptions of the columns of the corresponding data table, such 
as their captions, whether the columns are "hidden" and so forth. Further, the storage 
of a data table schema 106 also includes identification and descriptions of the 
associated scripts. The storage of a data table's data 104 on the other hand includes 
25 identifications and descriptions of the rows of the data table, such as their row 
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identifications, their owners (if user information are to be included), creation dates 
and so forth. 

Continuing to refer to Figs. 2-3, up6n copying/archiving data tables' data 104 
and their schemas 106, copy/archive utility 112 copies files 102 of file system 122 into 
5 file 300, placing them as data objects 316 under root directory 300, block 218. In one 
embodiment, files 102 that are not in XML format are converted into XML format. 
Thereafter, at block 220, copy/archive utility 112 further converts the ACLs of 
applicable ones of files 102 (which is normally kept in a binary format) to XML format. 
Lastly, for the copying/archiving process of the illustrated embodiment, 
fllO copy/archive utility 1 12 invokes a compression function to compress or "zip up" file 
u 300, block 222. The compression function may be any one of a number of such 
\j\ functions or utilities known in the art. 

\f Thus, it can be seen from the above description, a web based application (or 

q portion thereof, and regardless whether it is "fully" or "partially" developed) may be 
{^15 efficiently copied into a single or multiple files. Further, under the present invention, 
;» the content of file 300, once unzipped/decompressed, is advantageously humanly 
|SBa readable, as well as open to manipulation by any one of a number of common 
"editing" tools known in the art. 

20 Copying/Restoring 

Referring now to Figure 4, wherein a block diagram illustrating the relevant 
operational flow of copy/archive utility 112 of Fig. 1 in support of the DB and OS 
independent manner of copying/restoring of the present invention, in accordance with 
one embodiment, is shown. The description of the copying/restoring process 

25 illustrated in Fig. 4, will also continue to reference Fig. 3, wherein the internal 

organization of the DB and OS independent resultant file 300, in accordance with one 
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embodiment, is illustrated. Again, copy/archive utility 112 is assumed to have at least 
the capability of interacting with the operating system, more specifically, its file 
system, and the database, of the target domain, to which web based application 100 
is being copied/restored into. Such capabilities include e.g. the ability to provide data 
5 and associated control information, and request their storage, of the database and 
the file system, in the manner required by the database and the file system. In one 
embodiment, copy/archive utility 112 also supports an operation option for 
copying/restoring an application with or without the user information. Obviously, the 
option has no substantive effect if the application was not copied/archived with user 
fllO information. In a preferred embodiment, the copying/restoration still proceeds with a 
M "warning" message informing the user that user information was not present. 
;ii As shown, for the embodiment, the copying/restoring process starts at block 

M 402 with copy/archive utility 112 retrieving the structural description of web based 
□ application 100 from storage object 306 (upon unzipping or decompressing file 300, if 
;ijl5 it was zipped/compressed). Upon retrieval, at block 404, copy/archive utility 112 
determines the non-file system structures 104-108 of web based application 100, 
including the constitutions of these structures 104-108. For example, in the case of 
data tables 104, identifying their schemas 106. At block 406, copy/archive utility 112 
determines the file make-ups 104 of web based application 100, including the files' 
20 pathnames. Next, at block 408, assuming the copy/restore operation is to include 
user information and the information are present in file 300, copy/archive utility 112 
retrieves the user description of web based application 100 from storage object 308. 

Thereafter, at blocks 410^418, copy/archive utility 112 successively retrieves 
data table schemas 104 and data tables 106 (packaged in a self-describing format, 
25 such as XML) from their storage objects 312-314. For each retrieved data table 

schema 106 or data table 104, copy/archive utility 112 stores the retrieved data table 
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schema 106 or the retrieved data table 104 (in the self-describing format) in 
temporary storage (block 410). If the retrieved item is a data table schema 106, 
copy/archive utility 112 determines if the corresponding data table 104 has been 
retrieved, block 412. Likewise, if the retrieved item is a data table 104, copy/archive 
5 utility 112 determines if the corresponding data table schema 106 has been retrieved, 
block 412. In each case, if both the data table schema 106 and the corresponding 
data table 104 have been retrieved and stored into temporary storage, the data table 
104 is re-created under the database system 124 of the domain within which web 
based application 100 is being installed or restored into, and the data are imported 
IllO into the re-created table 104, block 414. Upon importing the data of the data table 
u into the re-created data table 104, copy/archive utility 112 deletes the retrieved data 
ill table schema 106 and retrieved data table 104 from their temporal storage locations, 
y block 416. Note that in alternate embodiments, data table schemas 106 and data 

tables' data 104, may be retrieved from file 300 and imported into the target 
JfjiS schemas/tables without employing intermediate temporary storage. 
\i\ Back at block 412, if the corresponding data table schema 106 or data 

^~ table 104 has not been retrieved, copy/archive utility 112 returns to block 410, as it is 
"guaranteed" that there is at least one other data item to be retrieved. On the other 
hand, if the corresponding data table schema 106 or data table 104 has been 
20 retrieved, upon performing the earlier described importation and clean up of the 
temporary storage, copy/archive utility 112 continues at block 418, wherein a 
determination is made on whether there is at least one other data item to be retrieved 
and restored. If so, the process returns to block 410, otherwise the process 
continues at block 420. 
25 At block 420, if the user list was retrieved back at block 408, the retrieved user 

list is processed. Assuming the domain to which web based application 100 is being 
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copied/restored into supports a programming interface for adding log-in users, the 
processing includes, for each user of web based application 100, copy/archive utility 
112 checking with the registration service of the target domain to determine if the 
user is a registered user of the target domain. If not, in a preferred embodiment, 
5 copy/archive utility 1 12 registers the user with the target domain. 

Next at block 422, again assuming user information are to be included, and 
they were retrieved, copy/archive utility 112 proceeds to fix up references to 
users/user objects included in data tables 104. In particular, where address book 107 
includes the log-in user names of at least some of users having address entries in 
JJilO address book 107, copy/archive utility 112 updates the log-in user names of the users 
ill of web based application 100 with the log-in user names of the users for the 

copy/restore target domain. In various embodiments, the fix-up operations also 
irj include conditionally deleting the user log-in names from address table 107, when 
;U address table 107 was initially imported. Further, in various embodiment, the fix-up 
operations also include determining if a user already has a corresponding address 
Iff entry in address table 107, if not creating the corresponding address entry in address 
- table 107. 

Thereafter, at block 422, copy/archive utility 112 retrieves files 102 from 
storage objects 316, and stores them under file system 122 of the target domain. For 

20 the illustrated embodiment, recall an ACL file is converted into an XML format during 
the "copying" process. Accordingly during the "restoration" process, copy/archive 
utility 112 converts the ACLs back into a binary format. Additionally, if any of files 102 
are to be stored in a format other than XML format, the files are converted 
accordingly. In various embodiments, the conversion requirements are described as 

25 part of the XML format. 
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Accordingly, the database and operating system independent copy 300 of web 
based application 100 may be used as a back up/restoration copy, or may be used as 
a copy to port application 100 (in whole or in part, regardless application 100 is fully 
or partially developed) to another domain of like or different execution environment. 

5 

Example Computer System 
Figure 5 illustrates an example computer system suitable for use to practice 
the present invention in accordance with one embodiment. As shown, computer 
system 500 includes one or more processors 502 and system memory 504. 
;|10 Additionally, computer system 500 includes mass storage devices 506 (such as 
j"* diskette, hard drive, CDROM and so forth), input/output devices 508 (such as 
M keyboard, cursor control and so forth) and communication interfaces 510 (such as 
\! network interface cards, modems and so forth). The elements are coupled to each 

□ other via system bus 512, which represents one or more buses. In the case of 

ill 5 multiple buses, they are bridged by one or more bus bridges (not shown). Each of 

□ these elements performs its conventional functions known in the art. In particular, 
system memory 504 and mass storage 506 are employed to store a working copy 
and a permanent copy of the programming instructions implementing the 
copy/archive utility of the present invention (or a component incorpating the 

20 copy/archive utility). The permanent copy of the programming instructions may be 
loaded into mass storage 506 in the factory, or in the field, as described earlier, 
through a distribution medium (not shown) or through communication interface 510 
(from a distribution server (not shown). The constitution of these elements 502-512 
are known, and accordingly will not be further described. 

25 

Conclusion and Epilog 
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Thus, an improved method and apparatus for efficiently copying/archiving a 
web based application has been described. While the present invention has been 
described in terms of the above illustrated embodiments, those skilled in the art will 
recognize that the invention is not limited to the embodiments described. The present 
invention can be practiced with modification and alteration within the spirit and scope 
of the appended claims. Thus, the description is to be regarded as illustrative instead 
of restrictive on the present invention. 
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